AITopics | input-output mapping

Neural Information Processing Systems http://nips.cc/

matrix, rnn, urnn, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Norfolk County > Wellesley (0.04)
North America > Canada (0.04)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.94)

Add feedback

Input-Output Equivalence of Unitary and Contractive RNNs

Neural Information Processing SystemsDec-25-2025, 19:03:45 GMT

Unitary recurrent neural networks (URNNs) have been proposed as a method to overcome the vanishing and exploding gradient problem in modeling data with long-term dependencies. A basic question is how restrictive is the unitary constraint on the possible input-output mappings of such a network? This works shows that for any contractive RNN with ReLU activations, there is a URNN with at most twice the number of hidden states and the identical input-output mapping. Hence, with ReLU activations, URNNs are as expressive as general RNNs. In contrast, for certain smooth activations, it is shown that the input-output mapping of an RNN cannot be matched with a URNN, even with an arbitrary number of states. The theoretical results are supported by experiments on modeling of slowly-varying dynamical systems.

input-output equivalence, name change, unitary and contractive rnn, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.61)

Add feedback

f21e255f89e0f258accbe4e984eef486-AuthorFeedback.pdf

Neural Information Processing SystemsOct-9-2025, 15:41:33 GMT

artificial intelligence, machine learning, matrix factorization, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Input-Output Equivalence of Unitary and Contractive RNNs

Melikasadat Emami, Mojtaba Sahraee Ardakan, Sundeep Rangan, Alyson K. Fletcher

Neural Information Processing SystemsOct-3-2025, 07:38:17 GMT

When the transition matrix has an induced norm greater than one, the RNN may become unstable.

matrix, rnn, urnn, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Norfolk County > Wellesley (0.04)
North America > Canada (0.04)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.94)

Add feedback

9c449771d0edc923c2713a7462cefa3b-AuthorFeedback.pdf

Neural Information Processing SystemsOct-3-2025, 07:38:03 GMT

constraint, contractivity constraint, reviewer, (17 more...)

Neural Information Processing Systems

Genre: Research Report > New Finding (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.32)

Add feedback

Input-Output Equivalence of Unitary and Contractive RNNs

Neural Information Processing SystemsOct-10-2024, 14:38:01 GMT

Unitary recurrent neural networks (URNNs) have been proposed as a method to overcome the vanishing and exploding gradient problem in modeling data with long-term dependencies. A basic question is how restrictive is the unitary constraint on the possible input-output mappings of such a network? This works shows that for any contractive RNN with ReLU activations, there is a URNN with at most twice the number of hidden states and the identical input-output mapping. Hence, with ReLU activations, URNNs are as expressive as general RNNs. In contrast, for certain smooth activations, it is shown that the input-output mapping of an RNN cannot be matched with a URNN, even with an arbitrary number of states.

input-output equivalence, input-output mapping, unitary and contractive rnn, (3 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)

Add feedback

The Mean Dimension of Neural Networks -- What causes the interaction effects?

Hahn, Roman, Feinauer, Christoph, Borgonovo, Emanuele

arXiv.org Machine LearningJul-11-2022

Owen and Hoyt recently showed that the effective dimension offers key structural information about the input-output mapping underlying an artificial neural network. Along this line of research, this work proposes an estimation procedure that allows the calculation of the mean dimension from a given dataset, without resampling from external distributions. The design yields total indices when features are independent and a variant of total indices when features are correlated. We show that this variant possesses the zero independence property. With synthetic datasets, we analyse how the mean dimension evolves layer by layer and how the activation function impacts the magnitude of interactions. We then use the mean dimension to study some of the most widely employed convolutional architectures for image recognition (LeNet, ResNet, DenseNet). To account for pixel correlations, we propose calculating the mean dimension after the addition of an inverse PCA layer that allows one to work on uncorrelated PCA-transformed features, without the need to retrain the neural network. We use the generalized total indices to produce heatmaps for post-hoc explanations, and we employ the mean dimension on the PCA-transformed features for cross comparisons of the artificial neural networks structures. Results provide several insights on the difference in magnitude of interactions across the architectures, as well as indications on how the mean dimension evolves during training.

artificial intelligence, machine learning, mean dimension, (16 more...)

arXiv.org Machine Learning

2207.0489

Country:

North America > Canada > Ontario > Toronto (0.14)
Europe > Italy > Lombardy > Milan (0.04)

Genre: Research Report > New Finding (0.46)

Industry: Water & Waste Management > Solid Waste Management (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Input-Output Equivalence of Unitary and Contractive RNNs

Emami, Melikasadat, Ardakan, Mojtaba Sahraee, Rangan, Sundeep, Fletcher, Alyson K.

Neural Information Processing SystemsMar-19-2020, 03:02:43 GMT

Unitary recurrent neural networks (URNNs) have been proposed as a method to overcome the vanishing and exploding gradient problem in modeling data with long-term dependencies. A basic question is how restrictive is the unitary constraint on the possible input-output mappings of such a network? This works shows that for any contractive RNN with ReLU activations, there is a URNN with at most twice the number of hidden states and the identical input-output mapping. Hence, with ReLU activations, URNNs are as expressive as general RNNs. In contrast, for certain smooth activations, it is shown that the input-output mapping of an RNN cannot be matched with a URNN, even with an arbitrary number of states.

input-output equivalence, input-output mapping, unitary and contractive rnn, (3 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)

Add feedback

Input-Output Equivalence of Unitary and Contractive RNNs

Emami, M., Sahraee-Ardakan, M., Rangan, S., Fletcher, A. K.

arXiv.org Machine LearningOct-30-2019

Unitary recurrent neural networks (URNNs) have been proposed as a method to overcome the vanishing and exploding gradient problem in modeling data with long-term dependencies. A basic question is how restrictive is the unitary constraint on the possible input-output mappings of such a network? This work shows that for any contractive RNN with ReLU activations, there is a URNN with at most twice the number of hidden states and the identical input-output mapping. Hence, with ReLU activations, URNNs are as expressive as general RNNs. In contrast, for certain smooth activations, it is shown that the input-output mapping of an RNN cannot be matched with a URNN, even with an arbitrary number of states. The theoretical results are supported by experiments on modeling of slowly-varying dynamical systems.

contractive rnn, rnn, urnn, (16 more...)

arXiv.org Machine Learning

1910.13672

Country: North America > United States > Massachusetts > Norfolk County > Wellesley (0.04)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.94)

Add feedback

Interpreting Layered Neural Networks via Hierarchical Modular Representation

Watanabe, Chihiro

arXiv.org Machine LearningOct-3-2018

Interpreting the prediction mechanism of complex models is currently one of the most important tasks in the machine learning field, especially with layered neural networks, which have achieved high predictive performance with various practical data sets. To reveal the global structure of a trained neural network in an interpretable way, a series of clustering methods have been proposed, which decompose the units into clusters according to the similarity of their inference roles. The main problems in these studies were that (1) we have no prior knowledge about the optimal resolution for the decomposition, or the appropriate number of clusters, and (2) there was no method with which to acquire knowledge about whether the outputs of each cluster have a positive or negative correlation with the input and output dimension values. In this paper, to solve these problems, we propose a method for obtaining a hierarchical modular representation of a layered neural network. The application of a hierarchical clustering method to a trained network reveals a tree-structured relationship among hidden layer units, based on their feature vectors defined by their correlation with the input and output dimension values.

artificial intelligence, feature vector, machine learning, (14 more...)

arXiv.org Machine Learning

1810.01588

Country: Asia > Japan (0.28)

Genre: Research Report > New Finding (0.34)

Technology: